Variational Inference for Crowdsourcing
نویسندگان
چکیده
Crowdsourcing has become a popular paradigm for labeling large datasets. However, it has given rise to the computational task of aggregating the crowdsourced labels provided by a collection of unreliable annotators. We approach this problem by transforming it into a standard inference problem in graphical models, and applying approximate variational methods, including belief propagation (BP) and mean field (MF). We show that our BP algorithm generalizes both majority voting and a recent algorithm by Karger et al. [1], while our MF method is closely related to a commonly used EM algorithm. In both cases, we find that the performance of the algorithms critically depends on the choice of a prior distribution on the workers’ reliability; by choosing the prior properly, both BP and MF (and EM) perform surprisingly well on both simulated and real-world datasets, competitive with state-of-the-art algorithms based on more complicated modeling assumptions.
منابع مشابه
A Derivation of the Belief Propagation Algorithm
This document contains derivations and other supplemental information for the NIPS 2012 submission , " Variational Inference for Crowdsourcing ". We derive the belief propagation algorithm (15) in Theorem 3.1.
متن کاملFast Inference for Interactive Models of Text
Probabilistic models are a useful means for analyzing large text corpora. Integrating such models with human interaction enables many new use cases. However, adding human interaction to probabilistic models requires inference algorithms which are both fast and accurate. We explore the use of Iterated Conditional Modes as a fast alternative to Gibbs sampling or variational EM. We demonstrate sup...
متن کاملEarly Gains Matter: A Case for Preferring Generative over Discriminative Crowdsourcing Models
Introduction. Here we derive mean field variational updates for MOMRESP. Although this derivation is largely a mechanical exercise, it is our belief that there is a contingent of crowdsourcing practitioners whose background is more practical than theoretical and who may appreciate seeing the mechanics of mean-field variational inference presented in a high level of detail for a model they are f...
متن کاملCrowd-Selection Query Processing in Crowdsourcing Databases: A Task-Driven Approach
Crowd-selection is essential to crowdsourcing applications, since choosing the right workers with particular expertise to carry out specific crowdsourced tasks is extremely important. The central problem is simple but tricky: given a crowdsourced task, who is the right worker to ask? Currently, most existing work has mainly studied the problem of crowd-selection for simple crowdsourced tasks su...
متن کاملNference - R Ules via C Rowdsourcing
The importance of inference rules to semantic applications has long been recognized, and extensive work has been carried out to automatically acquire inference-rule resources. However, despite their potential, the utilization of inference rule resources is currently somewhat limited, in part due to the considerable number of rules which are in fact invalid. A possible solution to this problem i...
متن کامل